The following datasets will be used in the course:
Real data on 963 passengers on the Titanic
Real data on 10,000 customers of an airline
Real data on 150 iris flowers.
Real data on emotions, ideology, and party affiliation as predictors of attitudes towards government action on climate change.
Real data on risk for heart disease.
Real data on Alzheimer’s Disease.
Real neuroimaging data from the Human Connectome Project used to predict scores on a memory test. Note: artificially modified to increase predictive power and make activities more engaging.
Real data on 3276 different water bodies. Modified to turn
Potability from a numeric variable (dummy code) into a
character variable.
Simulated data on 1470 fictional employees who either quit their job (attrition = yes) or did not (attrition = no).